Importance of intelligible phonemes for human speaker recognition in different channel bandwidths
نویسندگان
چکیده
It is known that nasal consonants and vowels are more effective than other phonemes for human speaker recognition. However, the influence of channel transmissions on the speakerdiscriminative capabilities of phonemes has not yet been examined. Specifically, the speech bandwidth has a strong effect on the human speaker recognition performance and also on the speech intelligibility. The phonemes that permit more accurate human speaker recognition are determined in this study by means of a speaker verification auditory test, focusing on the differences in performance when the stimuli are presented to listeners in narrowband and in wideband. The speech intelligibility is also investigated via an intelligibility test employing the same speech stimuli. Finally, the possible relationship between phonemes offering better human speaker recognition and more intelligible phonemes in the transition to an enhanced bandwidth is discussed.
منابع مشابه
I-vector speaker verification based on phonetic information under transmission channel effects
Past studies have shown evidence of important speakerspecific content in the higher frequencies of the spectrum, which are filtered out by narrowband channels. Besides, wideband transmissions, which are gaining ground over narrowband communications, offer an extended range of frequencies which account not only for better speech quality and intelligibility, but also for an improved speaker recog...
متن کاملپیشبینی قابلیت فهم همخوانها در افراد دارای شنوایی عادی با استفاده از مدلهای میکروسکوپی دارای معیار فاصله مختلف در بازشناساگر خودکار گفتار
In this study, recognition rates of consonants available in vowel-consonant-vowel structure in hearing tests and two microscopic models will be investigated. Such a syllable structure doesn’t exist in Farsi and Azerbaijani languages, but since the goal is only recognition of middle phoneme, according to hearing tests, listeners are able to properly recognize phonemes in clean speech conditions....
متن کاملOn the amount of speech data necessary for successful speaker identification
The paper deals with the dependence between the speaker identification performance and the amount of test data. Three speaker identification procedures based on hidden Markov models (HMMs) of phonemes are presented here. One, which is quite commonly used in the speaker recognition systems based on HMMs, uses the likelihood of the whole utterance for speaker identification. The other two that ar...
متن کاملA Comparative Study of Gender and Age Classification in Speech Signals
Accurate gender classification is useful in speech and speaker recognition as well as speech emotion classification, because a better performance has been reported when separate acoustic models are employed for males and females. Gender classification is also apparent in face recognition, video summarization, human-robot interaction, etc. Although gender classification is rather mature in a...
متن کاملCombining Gaussian Mixture Models and Segmental Feature Models for Speaker Recognition
In most speaker recognition systems speech utterances are not constrained in content or language. In a text-dependent speaker recognition system lexical content of speech and language are known in advance. The goal of this paper is to show that this information can be used by a segmental features (SF) approach to improve a standard Gaussian mixture model with MFCC features (GMM-MFCC). Speech fe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015